Concepedia

Concept

corpus analysis

Parents

Children

11.8K

Publications

758.4K

Citations

19.7K

Authors

3.4K

Institutions

Probabilistic Corpus Distribution Modeling

1975 - 1981

During the period, corpus analysis centered on probabilistic modeling to understand word usage and term discovery across large text collections. Researchers advanced automatic keyword indexing by employing word-distribution and Poisson-based models to separate specialty terms from general vocabulary, shaping core applications in corpus-based information retrieval. A parallel thread examined discourse variation and distributional properties of form frequencies, employing probabilistic indices and sampling-aware methods to assess reliability across corpora.

Representativeness-Driven Corpus Analytics

1982 - 1998

Corpus-driven Statistical NLP

1999 - 2005

Cross-Task Distributional Corpus Semantics (2006-2010)

2006 - 2010

Platform-Driven Corpus Analytics

2011 - 2017

Cross-Lingual Multimodal Embeddings

2018 - 2024